Overview

Dataset statistics

Number of variables37
Number of observations224281
Missing cells2681283
Missing cells (%)32.3%
Duplicate rows818
Duplicate rows (%)0.4%
Total size in memory63.3 MiB
Average record size in memory296.0 B

Variable types

DateTime1
Categorical29
Numeric5
Unsupported2

Warnings

Estado has constant value "CONSOLIDADO" Constant
Dataset has 818 (0.4%) duplicate rows Duplicates
Título has a high cardinality: 5592 distinct values High cardinality
Nome do programa has a high cardinality: 5153 distinct values High cardinality
Data de publicação has a high cardinality: 804 distinct values High cardinality
Série has a high cardinality: 264 distinct values High cardinality
Áreas temáticas has a high cardinality: 4314 distinct values High cardinality
Etapas de ensino has a high cardinality: 445 distinct values High cardinality
Faixa etária has a high cardinality: 277 distinct values High cardinality
Término da vigÊncia has a high cardinality: 56 distinct values High cardinality
Número do programa is highly correlated with Número do episódioHigh correlation
Número do episódio is highly correlated with Número do programaHigh correlation
Nome do programa has 74964 (33.4%) missing values Missing
Número do programa has 74964 (33.4%) missing values Missing
Data de publicação has 11580 (5.2%) missing values Missing
Status de visualização has 179340 (80.0%) missing values Missing
Série has 4899 (2.2%) missing values Missing
Número do episódio has 73157 (32.6%) missing values Missing
Áreas temáticas has 20328 (9.1%) missing values Missing
Etapas de ensino has 12855 (5.7%) missing values Missing
Públicos-alvo has 17325 (7.7%) missing values Missing
MECFlix has 65028 (29.0%) missing values Missing
MEC RED has 129300 (57.7%) missing values Missing
Disp. TV Escola Crianças has 129300 (57.7%) missing values Missing
Inédito has 65028 (29.0%) missing values Missing
LIBRAS has 65028 (29.0%) missing values Missing
Função do vídeo has 68190 (30.4%) missing values Missing
Versão brasileira has 206208 (91.9%) missing values Missing
Classificação indicativa has 69772 (31.1%) missing values Missing
Tipo de produção has 69845 (31.1%) missing values Missing
Ano de produção has 72511 (32.3%) missing values Missing
País de origem has 73518 (32.8%) missing values Missing
Data primeira exibição has 223197 (99.5%) missing values Missing
Faixa etária has 87735 (39.1%) missing values Missing
Término da vigÊncia has 181255 (80.8%) missing values Missing
Visualização sem autenticação has 65028 (29.0%) missing values Missing
Licença TV has 65028 (29.0%) missing values Missing
Licença streaming has 65028 (29.0%) missing values Missing
Licença VoD has 65028 (29.0%) missing values Missing
Finalidade do vídeo has 222742 (99.3%) missing values Missing
Tipo de vídeo has 223102 (99.5%) missing values Missing
Número do programa is highly skewed (γ1 = 36.03570618) Skewed
Número do episódio is highly skewed (γ1 = 36.26530462) Skewed
Ano de produção is highly skewed (γ1 = 47.91390264) Skewed
Visualizações is highly skewed (γ1 = 369.0234795) Skewed
Data de registro is an unsupported type, check if it needs cleaning or further analysis Unsupported
Duração is an unsupported type, check if it needs cleaning or further analysis Unsupported
Número do programa has 7188 (3.2%) zeros Zeros
Número do episódio has 7278 (3.2%) zeros Zeros
Visualizações has 111000 (49.5%) zeros Zeros

Reproduction

Analysis started2021-05-03 20:12:24.255408
Analysis finished2021-05-03 20:13:49.929080
Duration1 minute and 25.67 seconds
Software versionpandas-profiling v2.12.0
Download configurationconfig.yaml

Variables

Data
Date

Distinct69
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.7 MiB
Minimum2014-01-01 00:00:00
Maximum2020-01-01 00:00:00
Histogram with fixed size bins (bins=50)

Mês
Categorical

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.7 MiB
Janeiro
27154 
Dezembro
20264 
Novembro
20218 
Outubro
20105 
Julho
19785 
Other values (7)
116755 

Length

Max length9
Median length7
Mean length6.476803653
Min length4

Characters and Unicode

Total characters1452624
Distinct characters25
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowJaneiro
2nd rowJaneiro
3rd rowJaneiro
4th rowJaneiro
5th rowJaneiro
ValueCountFrequency (%)
Janeiro27154
12.1%
Dezembro20264
9.0%
Novembro20218
9.0%
Outubro20105
9.0%
Julho19785
8.8%
Março19685
8.8%
Abril19685
8.8%
Fevereiro17605
7.8%
Maio14945
6.7%
Agosto14945
6.7%
Other values (2)29890
13.3%
Histogram of lengths of the category
ValueCountFrequency (%)
janeiro27154
12.1%
dezembro20264
9.0%
novembro20218
9.0%
outubro20105
9.0%
julho19785
8.8%
março19685
8.8%
abril19685
8.8%
fevereiro17605
7.8%
setembro14945
6.7%
agosto14945
6.7%
Other values (2)29890
13.3%

Most occurring characters

ValueCountFrequency (%)
o239759
16.5%
r177266
12.2%
e170605
11.7%
b95217
 
6.6%
i79389
 
5.5%
u74940
 
5.2%
J61884
 
4.3%
a61784
 
4.3%
m55427
 
3.8%
t49995
 
3.4%
Other values (15)386358
26.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1228343
84.6%
Uppercase Letter224281
 
15.4%

Most frequent character per category

ValueCountFrequency (%)
o239759
19.5%
r177266
14.4%
e170605
13.9%
b95217
 
7.8%
i79389
 
6.5%
u74940
 
6.1%
a61784
 
5.0%
m55427
 
4.5%
t49995
 
4.1%
n42099
 
3.4%
Other values (7)181862
14.8%
ValueCountFrequency (%)
J61884
27.6%
M34630
15.4%
A34630
15.4%
D20264
 
9.0%
N20218
 
9.0%
O20105
 
9.0%
F17605
 
7.8%
S14945
 
6.7%

Most occurring scripts

ValueCountFrequency (%)
Latin1452624
100.0%

Most frequent character per script

ValueCountFrequency (%)
o239759
16.5%
r177266
12.2%
e170605
11.7%
b95217
 
6.6%
i79389
 
5.5%
u74940
 
5.2%
J61884
 
4.3%
a61784
 
4.3%
m55427
 
3.8%
t49995
 
3.4%
Other values (15)386358
26.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1432939
98.6%
None19685
 
1.4%

Most frequent character per block

ValueCountFrequency (%)
o239759
16.7%
r177266
12.4%
e170605
11.9%
b95217
 
6.6%
i79389
 
5.5%
u74940
 
5.2%
J61884
 
4.3%
a61784
 
4.3%
m55427
 
3.9%
t49995
 
3.5%
Other values (14)366673
25.6%
ValueCountFrequency (%)
ç19685
100.0%

Ano
Real number (ℝ≥0)

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.811678
Minimum2014
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 MiB

Quantile statistics

Minimum2014
5-th percentile2014
Q12015
median2017
Q32018
95-th percentile2019
Maximum2020
Range6
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.762153462
Coefficient of variation (CV)0.0008737322782
Kurtosis-1.170408414
Mean2016.811678
Median Absolute Deviation (MAD)1
Skewness-0.2085579051
Sum452332540
Variance3.105184825
MonotocityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
201850040
22.3%
201939552
17.6%
201739288
17.5%
201532520
14.5%
201432508
14.5%
201624984
11.1%
20205389
 
2.4%
ValueCountFrequency (%)
201432508
14.5%
201532520
14.5%
201624984
11.1%
201739288
17.5%
201850040
22.3%
ValueCountFrequency (%)
20205389
 
2.4%
201939552
17.6%
201850040
22.3%
201739288
17.5%
201624984
11.1%

Título
Categorical

HIGH CARDINALITY

Distinct5592
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.7 MiB
Episódio 1
 
312
Episódio 2
 
312
Episódio 3
 
247
Episódio 4
 
243
Episódio 5
 
195
Other values (5587)
222972 

Length

Max length138
Median length24
Mean length28.53451251
Min length1

Characters and Unicode

Total characters6399749
Distinct characters121
Distinct categories17 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)< 0.1%

Sample

1st rowTESTE TVoD upload via base integradora
2nd rowTeste TVoD Habitantes de Babel
3rd rowMúsica 005
4th rowMúsica 005
5th rowAlimentação - Com libras
ValueCountFrequency (%)
Episódio 1312
 
0.1%
Episódio 2312
 
0.1%
Episódio 3247
 
0.1%
Episódio 4243
 
0.1%
Episódio 5195
 
0.1%
Episódio 12189
 
0.1%
Episódio 11189
 
0.1%
Episódio 10189
 
0.1%
Episódio 13180
 
0.1%
Comunicação159
 
0.1%
Other values (5582)222066
99.0%
Histogram of lengths of the category
ValueCountFrequency (%)
71161
 
6.5%
e38008
 
3.4%
de36962
 
3.4%
o25962
 
2.4%
a25170
 
2.3%
da23573
 
2.1%
do22624
 
2.1%
programa18850
 
1.7%
libras17141
 
1.6%
com15426
 
1.4%
Other values (6493)807792
73.3%

Most occurring characters

ValueCountFrequency (%)
883166
 
13.8%
a638062
 
10.0%
o477895
 
7.5%
e473352
 
7.4%
i357323
 
5.6%
r344243
 
5.4%
s310261
 
4.8%
n239984
 
3.7%
t229987
 
3.6%
d222941
 
3.5%
Other values (111)2222535
34.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter4593961
71.8%
Space Separator883166
 
13.8%
Uppercase Letter631758
 
9.9%
Decimal Number118621
 
1.9%
Dash Punctuation82764
 
1.3%
Other Punctuation52839
 
0.8%
Open Punctuation13118
 
0.2%
Close Punctuation13118
 
0.2%
Control8010
 
0.1%
Other Letter997
 
< 0.1%
Other values (7)1397
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
a638062
13.9%
o477895
10.4%
e473352
10.3%
i357323
 
7.8%
r344243
 
7.5%
s310261
 
6.8%
n239984
 
5.2%
t229987
 
5.0%
d222941
 
4.9%
m176695
 
3.8%
Other values (30)1123218
24.4%
ValueCountFrequency (%)
E62418
 
9.9%
P59544
 
9.4%
A55372
 
8.8%
C54374
 
8.6%
M45261
 
7.2%
O40274
 
6.4%
S34838
 
5.5%
R26694
 
4.2%
D24079
 
3.8%
T24047
 
3.8%
Other values (29)204857
32.4%
ValueCountFrequency (%)
,25359
48.0%
:10823
20.5%
.4615
 
8.7%
?4440
 
8.4%
/3010
 
5.7%
!2747
 
5.2%
"726
 
1.4%
'702
 
1.3%
&320
 
0.6%
#57
 
0.1%
Other values (2)40
 
0.1%
ValueCountFrequency (%)
126625
22.4%
221921
18.5%
014401
12.1%
312964
10.9%
410124
 
8.5%
57727
 
6.5%
66448
 
5.4%
76409
 
5.4%
86006
 
5.1%
95996
 
5.1%
ValueCountFrequency (%)
-75853
91.6%
5994
 
7.2%
917
 
1.1%
ValueCountFrequency (%)
º644
64.6%
ª353
35.4%
ValueCountFrequency (%)
+506
85.8%
|84
 
14.2%
ValueCountFrequency (%)
205
86.1%
33
 
13.9%
ValueCountFrequency (%)
280
89.5%
33
 
10.5%
ValueCountFrequency (%)
³21
50.0%
²21
50.0%
ValueCountFrequency (%)
883166
100.0%
ValueCountFrequency (%)
(13118
100.0%
ValueCountFrequency (%)
)13118
100.0%
ValueCountFrequency (%)
8010
100.0%
ValueCountFrequency (%)
́114
100.0%
ValueCountFrequency (%)
´4
100.0%
ValueCountFrequency (%)
$96
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin5226716
81.7%
Common1172919
 
18.3%
Inherited114
 
< 0.1%

Most frequent character per script

ValueCountFrequency (%)
a638062
 
12.2%
o477895
 
9.1%
e473352
 
9.1%
i357323
 
6.8%
r344243
 
6.6%
s310261
 
5.9%
n239984
 
4.6%
t229987
 
4.4%
d222941
 
4.3%
m176695
 
3.4%
Other values (71)1755973
33.6%
ValueCountFrequency (%)
883166
75.3%
-75853
 
6.5%
126625
 
2.3%
,25359
 
2.2%
221921
 
1.9%
014401
 
1.2%
(13118
 
1.1%
)13118
 
1.1%
312964
 
1.1%
:10823
 
0.9%
Other values (29)75571
 
6.4%
ValueCountFrequency (%)
́114
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII6168594
96.4%
None223579
 
3.5%
Punctuation7462
 
0.1%
Diacriticals114
 
< 0.1%

Most frequent character per block

ValueCountFrequency (%)
883166
14.3%
a638062
 
10.3%
o477895
 
7.7%
e473352
 
7.7%
i357323
 
5.8%
r344243
 
5.6%
s310261
 
5.0%
n239984
 
3.9%
t229987
 
3.7%
d222941
 
3.6%
Other values (72)1991380
32.3%
ValueCountFrequency (%)
ã58619
26.2%
ç48112
21.5%
á24279
10.9%
í19171
 
8.6%
ó19158
 
8.6%
é13225
 
5.9%
ê11591
 
5.2%
ú7933
 
3.5%
â4404
 
2.0%
õ3810
 
1.7%
Other values (22)13277
 
5.9%
ValueCountFrequency (%)
5994
80.3%
917
 
12.3%
280
 
3.8%
205
 
2.7%
33
 
0.4%
33
 
0.4%
ValueCountFrequency (%)
́114
100.0%

Nome do programa
Categorical

HIGH CARDINALITY
MISSING

Distinct5153
Distinct (%)3.5%
Missing74964
Missing (%)33.4%
Memory size1.7 MiB
UN21REV2013VOD
 
90
EK026L EPS12 VOD
 
90
ED01REV2014VOD
 
90
OPDE001 CORRER
 
78
SF2017080 VOD
 
66
Other values (5148)
148903 

Length

Max length48
Median length22
Mean length21.62536751
Min length6

Characters and Unicode

Total characters3229035
Distinct characters44
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)< 0.1%

Sample

1st rowTVOD_Teste1
2nd rowTVOD_Teste2
3rd rowSGE MUSICA BL1 SEX
4th rowSGE MUSICA BL2 SEX
5th rowEK007L EPS7 VOD
ValueCountFrequency (%)
UN21REV2013VOD90
 
< 0.1%
EK026L EPS12 VOD90
 
< 0.1%
ED01REV2014VOD90
 
< 0.1%
OPDE001 CORRER78
 
< 0.1%
SF2017080 VOD66
 
< 0.1%
NA3004 PUPUNHA VOD66
 
< 0.1%
KIWI002 VAMOS A PRAIA45
 
< 0.1%
UN09REV2013VOD45
 
< 0.1%
ATIV003 BASQUETEBOL 145
 
< 0.1%
EACE EDUC AMBIENTAL VOD45
 
< 0.1%
Other values (5143)148657
66.3%
(Missing)74964
33.4%
Histogram of lengths of the category
ValueCountFrequency (%)
vod59686
 
11.5%
e8490
 
1.6%
n6389
 
1.2%
spt5158
 
1.0%
noticias5100
 
1.0%
fil4755
 
0.9%
a4710
 
0.9%
de3879
 
0.7%
episodio3303
 
0.6%
o3043
 
0.6%
Other values (9451)414607
79.9%

Most occurring characters

ValueCountFrequency (%)
370020
 
11.5%
O235561
 
7.3%
A235343
 
7.3%
E214964
 
6.7%
0214846
 
6.7%
D163008
 
5.0%
I157787
 
4.9%
S133209
 
4.1%
R124881
 
3.9%
1108787
 
3.4%
Other values (34)1270629
39.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter2254909
69.8%
Decimal Number603496
 
18.7%
Space Separator370020
 
11.5%
Connector Punctuation341
 
< 0.1%
Lowercase Letter269
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
O235561
 
10.4%
A235343
 
10.4%
E214964
 
9.5%
D163008
 
7.2%
I157787
 
7.0%
S133209
 
5.9%
R124881
 
5.5%
M107382
 
4.8%
C103538
 
4.6%
T102248
 
4.5%
Other values (18)676988
30.0%
ValueCountFrequency (%)
0214846
35.6%
1108787
18.0%
269968
 
11.6%
345454
 
7.5%
438119
 
6.3%
528119
 
4.7%
825768
 
4.3%
725190
 
4.2%
624104
 
4.0%
923141
 
3.8%
ValueCountFrequency (%)
e132
49.1%
s66
24.5%
t66
24.5%
b5
 
1.9%
ValueCountFrequency (%)
_341
100.0%
ValueCountFrequency (%)
370020
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2255178
69.8%
Common973857
30.2%

Most frequent character per script

ValueCountFrequency (%)
O235561
 
10.4%
A235343
 
10.4%
E214964
 
9.5%
D163008
 
7.2%
I157787
 
7.0%
S133209
 
5.9%
R124881
 
5.5%
M107382
 
4.8%
C103538
 
4.6%
T102248
 
4.5%
Other values (22)677257
30.0%
ValueCountFrequency (%)
370020
38.0%
0214846
22.1%
1108787
 
11.2%
269968
 
7.2%
345454
 
4.7%
438119
 
3.9%
528119
 
2.9%
825768
 
2.6%
725190
 
2.6%
624104
 
2.5%
Other values (2)23482
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII3228986
> 99.9%
None49
 
< 0.1%

Most frequent character per block

ValueCountFrequency (%)
370020
 
11.5%
O235561
 
7.3%
A235343
 
7.3%
E214964
 
6.7%
0214846
 
6.7%
D163008
 
5.0%
I157787
 
4.9%
S133209
 
4.1%
R124881
 
3.9%
1108787
 
3.4%
Other values (32)1270580
39.3%
ValueCountFrequency (%)
Ç45
91.8%
Ã4
 
8.2%

Número do programa
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct986
Distinct (%)0.7%
Missing74964
Missing (%)33.4%
Infinite0
Infinite (%)0.0%
Mean75.27669321
Minimum0
Maximum17114
Zeros7188
Zeros (%)3.2%
Negative0
Negative (%)0.0%
Memory size1.7 MiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median11
Q339
95-th percentile458
Maximum17114
Range17114
Interquartile range (IQR)36

Descriptive statistics

Standard deviation390.3143839
Coefficient of variation (CV)5.18506283
Kurtosis1547.94539
Mean75.27669321
Median Absolute Deviation (MAD)10
Skewness36.03570618
Sum11240090
Variance152345.3183
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
113812
 
6.2%
29356
 
4.2%
37431
 
3.3%
07188
 
3.2%
46987
 
3.1%
56246
 
2.8%
65124
 
2.3%
74480
 
2.0%
84105
 
1.8%
94091
 
1.8%
Other values (976)80497
35.9%
(Missing)74964
33.4%
ValueCountFrequency (%)
07188
3.2%
113812
6.2%
29356
4.2%
37431
3.3%
46987
3.1%
ValueCountFrequency (%)
1711421
< 0.1%
1710121
< 0.1%
1708221
< 0.1%
120194
 
< 0.1%
11603
 
< 0.1%

Data de registro
Unsupported

REJECTED
UNSUPPORTED

Missing0
Missing (%)0.0%
Memory size1.7 MiB

Data de publicação
Categorical

HIGH CARDINALITY
MISSING

Distinct804
Distinct (%)0.4%
Missing11580
Missing (%)5.2%
Memory size1.7 MiB
13-05-2015
30120 
16-03-2016
 
14335
04-05-2015
 
8662
08-06-2015
 
7737
14-05-2015
 
5401
Other values (799)
146446 

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters2127010
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row16-12-2014
2nd row02-01-2014
3rd row24-06-2014
4th row24-06-2014
5th row10-06-2014
ValueCountFrequency (%)
13-05-201530120
 
13.4%
16-03-201614335
 
6.4%
04-05-20158662
 
3.9%
08-06-20157737
 
3.4%
14-05-20155401
 
2.4%
19-07-20185187
 
2.3%
24-07-20185021
 
2.2%
23-03-20174542
 
2.0%
18-07-20183901
 
1.7%
10-03-20163547
 
1.6%
Other values (794)124248
55.4%
(Missing)11580
 
5.2%
Histogram of lengths of the category
ValueCountFrequency (%)
13-05-201530120
 
14.2%
16-03-201614335
 
6.7%
04-05-20158662
 
4.1%
08-06-20157737
 
3.6%
14-05-20155401
 
2.5%
19-07-20185187
 
2.4%
24-07-20185021
 
2.4%
23-03-20174542
 
2.1%
18-07-20183901
 
1.8%
10-03-20163547
 
1.7%
Other values (794)124248
58.4%

Most occurring characters

ValueCountFrequency (%)
0478790
22.5%
-425402
20.0%
1359136
16.9%
2280248
13.2%
5154073
 
7.2%
6109838
 
5.2%
392615
 
4.4%
877334
 
3.6%
462733
 
2.9%
748845
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1701608
80.0%
Dash Punctuation425402
 
20.0%

Most frequent character per category

ValueCountFrequency (%)
0478790
28.1%
1359136
21.1%
2280248
16.5%
5154073
 
9.1%
6109838
 
6.5%
392615
 
5.4%
877334
 
4.5%
462733
 
3.7%
748845
 
2.9%
937996
 
2.2%
ValueCountFrequency (%)
-425402
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common2127010
100.0%

Most frequent character per script

ValueCountFrequency (%)
0478790
22.5%
-425402
20.0%
1359136
16.9%
2280248
13.2%
5154073
 
7.2%
6109838
 
5.2%
392615
 
4.4%
877334
 
3.6%
462733
 
2.9%
748845
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII2127010
100.0%

Most frequent character per block

ValueCountFrequency (%)
0478790
22.5%
-425402
20.0%
1359136
16.9%
2280248
13.2%
5154073
 
7.2%
6109838
 
5.2%
392615
 
4.4%
877334
 
3.6%
462733
 
2.9%
748845
 
2.3%

Duração
Unsupported

REJECTED
UNSUPPORTED

Missing0
Missing (%)0.0%
Memory size1.7 MiB

Status de visualização
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing179340
Missing (%)80.0%
Memory size1.7 MiB
Liberado
29653 
Liberação pendente
15288 

Length

Max length18
Median length8
Mean length11.40179346
Min length8

Characters and Unicode

Total characters512408
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLiberação pendente
2nd rowLiberação pendente
3rd rowLiberação pendente
4th rowLiberação pendente
5th rowLiberado
ValueCountFrequency (%)
Liberado29653
 
13.2%
Liberação pendente15288
 
6.8%
(Missing)179340
80.0%
Histogram of lengths of the category
ValueCountFrequency (%)
liberado29653
49.2%
liberação15288
25.4%
pendente15288
25.4%

Most occurring characters

ValueCountFrequency (%)
e90805
17.7%
L44941
8.8%
i44941
8.8%
b44941
8.8%
r44941
8.8%
a44941
8.8%
o44941
8.8%
d44941
8.8%
n30576
 
6.0%
ç15288
 
3.0%
Other values (4)61152
11.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter452179
88.2%
Uppercase Letter44941
 
8.8%
Space Separator15288
 
3.0%

Most frequent character per category

ValueCountFrequency (%)
e90805
20.1%
i44941
9.9%
b44941
9.9%
r44941
9.9%
a44941
9.9%
o44941
9.9%
d44941
9.9%
n30576
 
6.8%
ç15288
 
3.4%
ã15288
 
3.4%
Other values (2)30576
 
6.8%
ValueCountFrequency (%)
L44941
100.0%
ValueCountFrequency (%)
15288
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin497120
97.0%
Common15288
 
3.0%

Most frequent character per script

ValueCountFrequency (%)
e90805
18.3%
L44941
9.0%
i44941
9.0%
b44941
9.0%
r44941
9.0%
a44941
9.0%
o44941
9.0%
d44941
9.0%
n30576
 
6.2%
ç15288
 
3.1%
Other values (3)45864
9.2%
ValueCountFrequency (%)
15288
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII481832
94.0%
None30576
 
6.0%

Most frequent character per block

ValueCountFrequency (%)
e90805
18.8%
L44941
9.3%
i44941
9.3%
b44941
9.3%
r44941
9.3%
a44941
9.3%
o44941
9.3%
d44941
9.3%
n30576
 
6.3%
15288
 
3.2%
Other values (2)30576
 
6.3%
ValueCountFrequency (%)
ç15288
50.0%
ã15288
50.0%

Série
Categorical

HIGH CARDINALITY
MISSING

Distinct264
Distinct (%)0.1%
Missing4899
Missing (%)2.2%
Memory size1.7 MiB
HORA DO ENEM
28081 
SALTO PARA O FUTURO - ACERVO
 
8970
Especiais Diversos
 
6714
KIWI
 
5370
E-Notícias
 
5192
Other values (259)
165055 

Length

Max length67
Median length18
Mean length19.16565625
Min length3

Characters and Unicode

Total characters4204600
Distinct characters93
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row500 ANOS — O BRASIL COLÔNIA NA TV
2nd row500 ANOS — O BRASIL COLÔNIA NA TV
3rd row500 ANOS — O BRASIL COLÔNIA NA TV
4th row500 ANOS — O BRASIL COLÔNIA NA TV
5th row500 ANOS — O BRASIL COLÔNIA NA TV
ValueCountFrequency (%)
HORA DO ENEM28081
 
12.5%
SALTO PARA O FUTURO - ACERVO8970
 
4.0%
Especiais Diversos6714
 
3.0%
KIWI5370
 
2.4%
E-Notícias5192
 
2.3%
SALTO PARA O FUTURO4780
 
2.1%
SALA DE PROFESSOR4668
 
2.1%
INVASÃO PLÂNCTON4503
 
2.0%
CONHECENDO MUSEUS4023
 
1.8%
SALA DE PROFESSOR (Libras)3972
 
1.8%
Other values (254)143109
63.8%
(Missing)4899
 
2.2%
Histogram of lengths of the category
ValueCountFrequency (%)
do44915
 
6.0%
enem30688
 
4.1%
hora30583
 
4.1%
o23369
 
3.1%
de21334
 
2.8%
escola20560
 
2.7%
17853
 
2.4%
para15606
 
2.1%
futuro14545
 
1.9%
salto13750
 
1.8%
Other values (418)518781
69.0%

Most occurring characters

ValueCountFrequency (%)
533835
 
12.7%
A408525
 
9.7%
O350623
 
8.3%
E299145
 
7.1%
S236809
 
5.6%
R206523
 
4.9%
N166255
 
4.0%
I153145
 
3.6%
D152124
 
3.6%
T141852
 
3.4%
Other values (83)1555764
37.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter2993614
71.2%
Lowercase Letter575179
 
13.7%
Space Separator533835
 
12.7%
Dash Punctuation38674
 
0.9%
Other Punctuation20220
 
0.5%
Decimal Number17890
 
0.4%
Open Punctuation11346
 
0.3%
Close Punctuation11346
 
0.3%
Other Letter2223
 
0.1%
Currency Symbol273
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
A408525
13.6%
O350623
11.7%
E299145
 
10.0%
S236809
 
7.9%
R206523
 
6.9%
N166255
 
5.6%
I153145
 
5.1%
D152124
 
5.1%
T141852
 
4.7%
U126759
 
4.2%
Other values (28)751854
25.1%
ValueCountFrequency (%)
s76968
13.4%
a66219
11.5%
e63495
11.0%
o59198
10.3%
r54087
9.4%
i47654
8.3%
c25107
 
4.4%
d23074
 
4.0%
n20341
 
3.5%
t20091
 
3.5%
Other values (24)118945
20.7%
ValueCountFrequency (%)
26943
38.8%
04754
26.6%
32451
 
13.7%
52343
 
13.1%
11016
 
5.7%
6315
 
1.8%
948
 
0.3%
820
 
0.1%
ValueCountFrequency (%)
,7527
37.2%
?7011
34.7%
*3777
18.7%
:1873
 
9.3%
!32
 
0.2%
ValueCountFrequency (%)
-28505
73.7%
9159
 
23.7%
1010
 
2.6%
ValueCountFrequency (%)
533835
100.0%
ValueCountFrequency (%)
ª2223
100.0%
ValueCountFrequency (%)
(11346
100.0%
ValueCountFrequency (%)
)11346
100.0%
ValueCountFrequency (%)
$273
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin3571016
84.9%
Common633584
 
15.1%

Most frequent character per script

ValueCountFrequency (%)
A408525
 
11.4%
O350623
 
9.8%
E299145
 
8.4%
S236809
 
6.6%
R206523
 
5.8%
N166255
 
4.7%
I153145
 
4.3%
D152124
 
4.3%
T141852
 
4.0%
U126759
 
3.5%
Other values (63)1329256
37.2%
ValueCountFrequency (%)
533835
84.3%
-28505
 
4.5%
(11346
 
1.8%
)11346
 
1.8%
9159
 
1.4%
,7527
 
1.2%
?7011
 
1.1%
26943
 
1.1%
04754
 
0.8%
*3777
 
0.6%
Other values (10)9381
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII4091652
97.3%
None102779
 
2.4%
Punctuation10169
 
0.2%

Most frequent character per block

ValueCountFrequency (%)
533835
 
13.0%
A408525
 
10.0%
O350623
 
8.6%
E299145
 
7.3%
S236809
 
5.8%
R206523
 
5.0%
N166255
 
4.1%
I153145
 
3.7%
D152124
 
3.7%
T141852
 
3.5%
Other values (59)1442816
35.3%
ValueCountFrequency (%)
9159
90.1%
1010
 
9.9%
ValueCountFrequency (%)
Ã16693
16.2%
Á11631
11.3%
Ç11014
10.7%
í8606
8.4%
Ó8349
8.1%
Í6751
 
6.6%
Ô5350
 
5.2%
Ê5329
 
5.2%
Â4503
 
4.4%
É4495
 
4.4%
Other values (12)20058
19.5%

Número do episódio
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct981
Distinct (%)0.6%
Missing73157
Missing (%)32.6%
Infinite0
Infinite (%)0.0%
Mean74.33671687
Minimum0
Maximum17114
Zeros7278
Zeros (%)3.2%
Negative0
Negative (%)0.0%
Memory size1.7 MiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median11
Q339
95-th percentile456
Maximum17114
Range17114
Interquartile range (IQR)36

Descriptive statistics

Standard deviation387.9528034
Coefficient of variation (CV)5.218858455
Kurtosis1567.404299
Mean74.33671687
Median Absolute Deviation (MAD)9
Skewness36.26530462
Sum11234062
Variance150507.3776
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
114127
 
6.3%
29512
 
4.2%
37665
 
3.4%
07278
 
3.2%
47020
 
3.1%
56402
 
2.9%
65202
 
2.3%
74636
 
2.1%
84204
 
1.9%
94160
 
1.9%
Other values (971)80918
36.1%
(Missing)73157
32.6%
ValueCountFrequency (%)
07278
3.2%
114127
6.3%
29512
4.2%
37665
3.4%
47020
3.1%
ValueCountFrequency (%)
1711421
< 0.1%
1710121
< 0.1%
1708221
< 0.1%
120194
 
< 0.1%
11603
 
< 0.1%

Áreas temáticas
Categorical

HIGH CARDINALITY
MISSING

Distinct4314
Distinct (%)2.1%
Missing20328
Missing (%)9.1%
Memory size1.7 MiB
Escola-Educação
 
14295
Matemática
 
9182
Língua Inglesa
 
6356
Meio Ambiente
 
4515
Língua Portuguesa
 
4207
Other values (4309)
165398 

Length

Max length172
Median length26
Mean length26.45205023
Min length5

Characters and Unicode

Total characters5394975
Distinct characters48
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1334 ?
Unique (%)0.7%

Sample

1st rowDiversidade Cultural, Física
2nd rowDiversidade Cultural
3rd rowMúsica
4th rowMúsica
5th rowArtes, Matemática, Saúde
ValueCountFrequency (%)
Escola-Educação14295
 
6.4%
Matemática9182
 
4.1%
Língua Inglesa6356
 
2.8%
Meio Ambiente4515
 
2.0%
Língua Portuguesa4207
 
1.9%
Ciências, Meio Ambiente3530
 
1.6%
História3009
 
1.3%
Não Informado2916
 
1.3%
Redação2641
 
1.2%
Língua Portuguesa, Literatura2572
 
1.1%
Other values (4304)150730
67.2%
(Missing)20328
 
9.1%
Histogram of lengths of the category
ValueCountFrequency (%)
história46284
 
8.0%
escola-educação39203
 
6.7%
artes38104
 
6.6%
língua37342
 
6.4%
ciências33672
 
5.8%
matemática28891
 
5.0%
cultural27970
 
4.8%
diversidade27970
 
4.8%
portuguesa25671
 
4.4%
ambiente24774
 
4.3%
Other values (24)251797
43.3%

Most occurring characters

ValueCountFrequency (%)
a603283
 
11.2%
i501961
 
9.3%
377725
 
7.0%
o314117
 
5.8%
e304752
 
5.6%
t289714
 
5.4%
s274420
 
5.1%
,262612
 
4.9%
c256342
 
4.8%
r239240
 
4.4%
Other values (38)1970809
36.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter4094566
75.9%
Uppercase Letter620869
 
11.5%
Space Separator377725
 
7.0%
Other Punctuation262612
 
4.9%
Dash Punctuation39203
 
0.7%

Most frequent character per category

ValueCountFrequency (%)
a603283
14.7%
i501961
12.3%
o314117
 
7.7%
e304752
 
7.4%
t289714
 
7.1%
s274420
 
6.7%
c256342
 
6.3%
r239240
 
5.8%
u233265
 
5.7%
l165198
 
4.0%
Other values (17)912274
22.3%
ValueCountFrequency (%)
E108708
17.5%
A68551
11.0%
M65857
10.6%
C61642
9.9%
L55305
8.9%
H46284
7.5%
S38053
 
6.1%
F33160
 
5.3%
D27970
 
4.5%
P25671
 
4.1%
Other values (8)89668
14.4%
ValueCountFrequency (%)
377725
100.0%
ValueCountFrequency (%)
,262612
100.0%
ValueCountFrequency (%)
-39203
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4715435
87.4%
Common679540
 
12.6%

Most frequent character per script

ValueCountFrequency (%)
a603283
 
12.8%
i501961
 
10.6%
o314117
 
6.7%
e304752
 
6.5%
t289714
 
6.1%
s274420
 
5.8%
c256342
 
5.4%
r239240
 
5.1%
u233265
 
4.9%
l165198
 
3.5%
Other values (35)1533143
32.5%
ValueCountFrequency (%)
377725
55.6%
,262612
38.6%
-39203
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII5033624
93.3%
None361351
 
6.7%

Most frequent character per block

ValueCountFrequency (%)
a603283
 
12.0%
i501961
 
10.0%
377725
 
7.5%
o314117
 
6.2%
e304752
 
6.1%
t289714
 
5.8%
s274420
 
5.5%
,262612
 
5.2%
c256342
 
5.1%
r239240
 
4.8%
Other values (29)1609458
32.0%
ValueCountFrequency (%)
ã67454
18.7%
í65053
18.0%
ç64538
17.9%
ó46284
12.8%
ê33672
9.3%
á32949
9.1%
ú27929
7.7%
É23460
 
6.5%
Á12
 
< 0.1%

Etapas de ensino
Categorical

HIGH CARDINALITY
MISSING

Distinct445
Distinct (%)0.2%
Missing12855
Missing (%)5.7%
Memory size1.7 MiB
Geral
58890 
Ensino Médio
52143 
Ensino Fundamental II
32886 
Educação Infantil
20164 
Ensino Fundamental I
15010 
Other values (440)
32333 

Length

Max length133
Median length12
Mean length15.34431432
Min length5

Characters and Unicode

Total characters3244187
Distinct characters31
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique153 ?
Unique (%)0.1%

Sample

1st rowGeral
2nd rowGeral
3rd rowEnsino Fundamental II
4th rowGeral
5th rowEnsino Fundamental I
ValueCountFrequency (%)
Geral58890
26.3%
Ensino Médio52143
23.2%
Ensino Fundamental II32886
14.7%
Educação Infantil20164
 
9.0%
Ensino Fundamental I15010
 
6.7%
Ciclo de Alfabetização12480
 
5.6%
Superior3766
 
1.7%
Educação Infantil, Ensino Fundamental I3243
 
1.4%
Ensino Fundamental I, Educação Infantil1594
 
0.7%
Ensino Médio, Ensino Fundamental II1103
 
0.5%
Other values (435)10147
 
4.5%
(Missing)12855
 
5.7%
Histogram of lengths of the category
ValueCountFrequency (%)
ensino124329
25.9%
fundamental64256
13.4%
geral62838
13.1%
médio58563
12.2%
ii39835
 
8.3%
educação27768
 
5.8%
infantil27768
 
5.8%
i24421
 
5.1%
de14624
 
3.0%
ciclo14612
 
3.0%
Other values (4)21258
 
4.4%

Most occurring characters

ValueCountFrequency (%)
n434204
13.4%
a276134
 
8.5%
268846
 
8.3%
i246518
 
7.6%
o246518
 
7.6%
l184086
 
5.7%
d165211
 
5.1%
e161478
 
5.0%
E152097
 
4.7%
I131859
 
4.1%
Other values (21)977236
30.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2446613
75.4%
Uppercase Letter505471
 
15.6%
Space Separator268846
 
8.3%
Other Punctuation23257
 
0.7%

Most frequent character per category

ValueCountFrequency (%)
n434204
17.7%
a276134
11.3%
i246518
10.1%
o246518
10.1%
l184086
7.5%
d165211
 
6.8%
e161478
 
6.6%
s124341
 
5.1%
t106648
 
4.4%
u97160
 
4.0%
Other values (10)404315
16.5%
ValueCountFrequency (%)
E152097
30.1%
I131859
26.1%
F64256
12.7%
G62838
12.4%
M58563
 
11.6%
C14612
 
2.9%
A14612
 
2.9%
S5136
 
1.0%
T1498
 
0.3%
ValueCountFrequency (%)
268846
100.0%
ValueCountFrequency (%)
,23257
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2952084
91.0%
Common292103
 
9.0%

Most frequent character per script

ValueCountFrequency (%)
n434204
14.7%
a276134
 
9.4%
i246518
 
8.4%
o246518
 
8.4%
l184086
 
6.2%
d165211
 
5.6%
e161478
 
5.5%
E152097
 
5.2%
I131859
 
4.5%
s124341
 
4.2%
Other values (19)829638
28.1%
ValueCountFrequency (%)
268846
92.0%
,23257
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII3099366
95.5%
None144821
 
4.5%

Most frequent character per block

ValueCountFrequency (%)
n434204
14.0%
a276134
 
8.9%
268846
 
8.7%
i246518
 
8.0%
o246518
 
8.0%
l184086
 
5.9%
d165211
 
5.3%
e161478
 
5.2%
E152097
 
4.9%
I131859
 
4.3%
Other values (18)832415
26.9%
ValueCountFrequency (%)
é60061
41.5%
ç42380
29.3%
ã42380
29.3%

Públicos-alvo
Categorical

MISSING

Distinct16
Distinct (%)< 0.1%
Missing17325
Missing (%)7.7%
Memory size1.7 MiB
Público em geral
85608 
Aluno
72229 
Professor
32285 
Público em geral, Aluno, Professor
 
2303
Aluno, Público em geral, Professor
 
2038
Other values (11)
12493 

Length

Max length34
Median length9
Mean length12.23444597
Min length5

Characters and Unicode

Total characters2531992
Distinct characters21
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPúblico em geral
2nd rowPúblico em geral
3rd rowPúblico em geral
4th rowPúblico em geral
5th rowAluno
ValueCountFrequency (%)
Público em geral85608
38.2%
Aluno72229
32.2%
Professor32285
 
14.4%
Público em geral, Aluno, Professor2303
 
1.0%
Aluno, Público em geral, Professor2038
 
0.9%
Aluno, Público em geral1948
 
0.9%
Professor, Público em geral, Aluno1883
 
0.8%
Professor, Aluno, Público em geral1859
 
0.8%
Público em geral, Professor, Aluno1857
 
0.8%
Público em geral, Aluno1755
 
0.8%
Other values (6)3191
 
1.4%
(Missing)17325
 
7.7%
Histogram of lengths of the category
ValueCountFrequency (%)
geral101697
23.2%
em101697
23.2%
público101697
23.2%
aluno87962
20.1%
professor45404
10.4%
públicos-alvo12
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
l291380
 
11.5%
o280491
 
11.1%
e248798
 
9.8%
231513
 
9.1%
r192505
 
7.6%
P147113
 
5.8%
ú101709
 
4.0%
b101709
 
4.0%
i101709
 
4.0%
c101709
 
4.0%
Other values (11)733356
29.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2037273
80.5%
Uppercase Letter235075
 
9.3%
Space Separator231513
 
9.1%
Other Punctuation28119
 
1.1%
Dash Punctuation12
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
l291380
14.3%
o280491
13.8%
e248798
12.2%
r192505
9.4%
ú101709
 
5.0%
b101709
 
5.0%
i101709
 
5.0%
c101709
 
5.0%
a101709
 
5.0%
m101697
 
5.0%
Other values (6)413857
20.3%
ValueCountFrequency (%)
P147113
62.6%
A87962
37.4%
ValueCountFrequency (%)
231513
100.0%
ValueCountFrequency (%)
,28119
100.0%
ValueCountFrequency (%)
-12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin2272348
89.7%
Common259644
 
10.3%

Most frequent character per script

ValueCountFrequency (%)
l291380
12.8%
o280491
12.3%
e248798
10.9%
r192505
 
8.5%
P147113
 
6.5%
ú101709
 
4.5%
b101709
 
4.5%
i101709
 
4.5%
c101709
 
4.5%
a101709
 
4.5%
Other values (8)603516
26.6%
ValueCountFrequency (%)
231513
89.2%
,28119
 
10.8%
-12
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII2430283
96.0%
None101709
 
4.0%

Most frequent character per block

ValueCountFrequency (%)
l291380
12.0%
o280491
11.5%
e248798
 
10.2%
231513
 
9.5%
r192505
 
7.9%
P147113
 
6.1%
b101709
 
4.2%
i101709
 
4.2%
c101709
 
4.2%
a101709
 
4.2%
Other values (10)631647
26.0%
ValueCountFrequency (%)
ú101709
100.0%

MECFlix
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing65028
Missing (%)29.0%
Memory size1.7 MiB
NÃO
88075 
SIM
71178 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters477759
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNÃO
2nd rowNÃO
3rd rowNÃO
4th rowNÃO
5th rowNÃO
ValueCountFrequency (%)
NÃO88075
39.3%
SIM71178
31.7%
(Missing)65028
29.0%
Histogram of lengths of the category
ValueCountFrequency (%)
não88075
55.3%
sim71178
44.7%

Most occurring characters

ValueCountFrequency (%)
N88075
18.4%
Ã88075
18.4%
O88075
18.4%
S71178
14.9%
I71178
14.9%
M71178
14.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter477759
100.0%

Most frequent character per category

ValueCountFrequency (%)
N88075
18.4%
Ã88075
18.4%
O88075
18.4%
S71178
14.9%
I71178
14.9%
M71178
14.9%

Most occurring scripts

ValueCountFrequency (%)
Latin477759
100.0%

Most frequent character per script

ValueCountFrequency (%)
N88075
18.4%
Ã88075
18.4%
O88075
18.4%
S71178
14.9%
I71178
14.9%
M71178
14.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII389684
81.6%
None88075
 
18.4%

Most frequent character per block

ValueCountFrequency (%)
N88075
22.6%
O88075
22.6%
S71178
18.3%
I71178
18.3%
M71178
18.3%
ValueCountFrequency (%)
Ã88075
100.0%

MEC RED
Categorical

MISSING

Distinct7
Distinct (%)< 0.1%
Missing129300
Missing (%)57.7%
Memory size1.7 MiB
NÃO
51176 
SIM
40512 
WAITING_TO_SEND
 
1680
WAITING_TO_UNPUBLISH
 
1065
ERROR_SEND
 
505
Other values (2)
 
43

Length

Max length20
Median length3
Mean length3.446310315
Min length3

Characters and Unicode

Total characters327334
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNÃO
2nd rowNÃO
3rd rowNÃO
4th rowNÃO
5th rowNÃO
ValueCountFrequency (%)
NÃO51176
 
22.8%
SIM40512
 
18.1%
WAITING_TO_SEND1680
 
0.7%
WAITING_TO_UNPUBLISH1065
 
0.5%
ERROR_SEND505
 
0.2%
ERROR_UNPUBLISHING25
 
< 0.1%
WAITING_PUBLISH18
 
< 0.1%
(Missing)129300
57.7%
Histogram of lengths of the category
ValueCountFrequency (%)
não51176
53.9%
sim40512
42.7%
waiting_to_send1680
 
1.8%
waiting_to_unpublish1065
 
1.1%
error_send505
 
0.5%
error_unpublishing25
 
< 0.1%
waiting_publish18
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
N57239
17.5%
O54451
16.6%
Ã51176
15.6%
I47171
14.4%
S43805
13.4%
M40512
12.4%
_6038
 
1.8%
T5508
 
1.7%
G2788
 
0.9%
W2763
 
0.8%
Other values (9)15883
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter321296
98.2%
Connector Punctuation6038
 
1.8%

Most frequent character per category

ValueCountFrequency (%)
N57239
17.8%
O54451
16.9%
Ã51176
15.9%
I47171
14.7%
S43805
13.6%
M40512
12.6%
T5508
 
1.7%
G2788
 
0.9%
W2763
 
0.9%
A2763
 
0.9%
Other values (8)13120
 
4.1%
ValueCountFrequency (%)
_6038
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin321296
98.2%
Common6038
 
1.8%

Most frequent character per script

ValueCountFrequency (%)
N57239
17.8%
O54451
16.9%
Ã51176
15.9%
I47171
14.7%
S43805
13.6%
M40512
12.6%
T5508
 
1.7%
G2788
 
0.9%
W2763
 
0.9%
A2763
 
0.9%
Other values (8)13120
 
4.1%
ValueCountFrequency (%)
_6038
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII276158
84.4%
None51176
 
15.6%

Most frequent character per block

ValueCountFrequency (%)
N57239
20.7%
O54451
19.7%
I47171
17.1%
S43805
15.9%
M40512
14.7%
_6038
 
2.2%
T5508
 
2.0%
G2788
 
1.0%
W2763
 
1.0%
A2763
 
1.0%
Other values (8)13120
 
4.8%
ValueCountFrequency (%)
Ã51176
100.0%

Disp. TV Escola Crianças
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing129300
Missing (%)57.7%
Memory size1.7 MiB
NÃO
82124 
SIM
12857 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters284943
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNÃO
2nd rowNÃO
3rd rowNÃO
4th rowNÃO
5th rowNÃO
ValueCountFrequency (%)
NÃO82124
36.6%
SIM12857
 
5.7%
(Missing)129300
57.7%
Histogram of lengths of the category
ValueCountFrequency (%)
não82124
86.5%
sim12857
 
13.5%

Most occurring characters

ValueCountFrequency (%)
N82124
28.8%
Ã82124
28.8%
O82124
28.8%
S12857
 
4.5%
I12857
 
4.5%
M12857
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter284943
100.0%

Most frequent character per category

ValueCountFrequency (%)
N82124
28.8%
Ã82124
28.8%
O82124
28.8%
S12857
 
4.5%
I12857
 
4.5%
M12857
 
4.5%

Most occurring scripts

ValueCountFrequency (%)
Latin284943
100.0%

Most frequent character per script

ValueCountFrequency (%)
N82124
28.8%
Ã82124
28.8%
O82124
28.8%
S12857
 
4.5%
I12857
 
4.5%
M12857
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII202819
71.2%
None82124
28.8%

Most frequent character per block

ValueCountFrequency (%)
N82124
40.5%
O82124
40.5%
S12857
 
6.3%
I12857
 
6.3%
M12857
 
6.3%
ValueCountFrequency (%)
Ã82124
100.0%

Inédito
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing65028
Missing (%)29.0%
Memory size1.7 MiB
NÃO
156286 
SIM
 
2967

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters477759
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNÃO
2nd rowNÃO
3rd rowNÃO
4th rowNÃO
5th rowNÃO
ValueCountFrequency (%)
NÃO156286
69.7%
SIM2967
 
1.3%
(Missing)65028
29.0%
Histogram of lengths of the category
ValueCountFrequency (%)
não156286
98.1%
sim2967
 
1.9%

Most occurring characters

ValueCountFrequency (%)
N156286
32.7%
Ã156286
32.7%
O156286
32.7%
S2967
 
0.6%
I2967
 
0.6%
M2967
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter477759
100.0%

Most frequent character per category

ValueCountFrequency (%)
N156286
32.7%
Ã156286
32.7%
O156286
32.7%
S2967
 
0.6%
I2967
 
0.6%
M2967
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Latin477759
100.0%

Most frequent character per script

ValueCountFrequency (%)
N156286
32.7%
Ã156286
32.7%
O156286
32.7%
S2967
 
0.6%
I2967
 
0.6%
M2967
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII321473
67.3%
None156286
32.7%

Most frequent character per block

ValueCountFrequency (%)
N156286
48.6%
O156286
48.6%
S2967
 
0.9%
I2967
 
0.9%
M2967
 
0.9%
ValueCountFrequency (%)
Ã156286
100.0%

LIBRAS
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing65028
Missing (%)29.0%
Memory size1.7 MiB
NÃO
148650 
SIM
 
10603

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters477759
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNÃO
2nd rowNÃO
3rd rowNÃO
4th rowNÃO
5th rowSIM
ValueCountFrequency (%)
NÃO148650
66.3%
SIM10603
 
4.7%
(Missing)65028
29.0%
Histogram of lengths of the category
ValueCountFrequency (%)
não148650
93.3%
sim10603
 
6.7%

Most occurring characters

ValueCountFrequency (%)
N148650
31.1%
Ã148650
31.1%
O148650
31.1%
S10603
 
2.2%
I10603
 
2.2%
M10603
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter477759
100.0%

Most frequent character per category

ValueCountFrequency (%)
N148650
31.1%
Ã148650
31.1%
O148650
31.1%
S10603
 
2.2%
I10603
 
2.2%
M10603
 
2.2%

Most occurring scripts

ValueCountFrequency (%)
Latin477759
100.0%

Most frequent character per script

ValueCountFrequency (%)
N148650
31.1%
Ã148650
31.1%
O148650
31.1%
S10603
 
2.2%
I10603
 
2.2%
M10603
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII329109
68.9%
None148650
31.1%

Most frequent character per block

ValueCountFrequency (%)
N148650
45.2%
O148650
45.2%
S10603
 
3.2%
I10603
 
3.2%
M10603
 
3.2%
ValueCountFrequency (%)
Ã148650
100.0%

Função do vídeo
Categorical

MISSING

Distinct11
Distinct (%)< 0.1%
Missing68190
Missing (%)30.4%
Memory size1.7 MiB
Programa
147117 
Interprograma
 
5074
Chamada
 
2086
Filler
 
708
Making of
 
521
Other values (6)
 
585

Length

Max length23
Median length8
Mean length8.167389536
Min length6

Characters and Unicode

Total characters1274856
Distinct characters25
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPrograma
2nd rowPrograma
3rd rowPrograma
4th rowPrograma
5th rowPrograma
ValueCountFrequency (%)
Programa147117
65.6%
Interprograma5074
 
2.3%
Chamada2086
 
0.9%
Filler708
 
0.3%
Making of521
 
0.2%
Programa, Interprograma172
 
0.1%
Vinheta135
 
0.1%
Trailer123
 
0.1%
Interprograma, Programa91
 
< 0.1%
Making Of52
 
< 0.1%
(Missing)68190
30.4%
Histogram of lengths of the category
ValueCountFrequency (%)
programa147380
93.9%
interprograma5337
 
3.4%
chamada2086
 
1.3%
filler708
 
0.5%
making573
 
0.4%
of573
 
0.4%
vinheta135
 
0.1%
trailer123
 
0.1%
trailler12
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
a312535
24.5%
r311749
24.5%
m154803
12.1%
g153290
12.0%
o153238
12.0%
P147380
11.6%
e6315
 
0.5%
n6045
 
0.5%
t5472
 
0.4%
I5337
 
0.4%
Other values (15)18692
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1117351
87.6%
Uppercase Letter156406
 
12.3%
Space Separator836
 
0.1%
Other Punctuation263
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
a312535
28.0%
r311749
27.9%
m154803
13.9%
g153290
13.7%
o153238
13.7%
e6315
 
0.6%
n6045
 
0.5%
t5472
 
0.5%
p5337
 
0.5%
h2221
 
0.2%
Other values (5)6346
 
0.6%
ValueCountFrequency (%)
P147380
94.2%
I5337
 
3.4%
C2086
 
1.3%
F708
 
0.5%
M573
 
0.4%
V135
 
0.1%
T135
 
0.1%
O52
 
< 0.1%
ValueCountFrequency (%)
836
100.0%
ValueCountFrequency (%)
,263
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1273757
99.9%
Common1099
 
0.1%

Most frequent character per script

ValueCountFrequency (%)
a312535
24.5%
r311749
24.5%
m154803
12.2%
g153290
12.0%
o153238
12.0%
P147380
11.6%
e6315
 
0.5%
n6045
 
0.5%
t5472
 
0.4%
I5337
 
0.4%
Other values (13)17593
 
1.4%
ValueCountFrequency (%)
836
76.1%
,263
 
23.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII1274856
100.0%

Most frequent character per block

ValueCountFrequency (%)
a312535
24.5%
r311749
24.5%
m154803
12.1%
g153290
12.0%
o153238
12.0%
P147380
11.6%
e6315
 
0.5%
n6045
 
0.5%
t5472
 
0.4%
I5337
 
0.4%
Other values (15)18692
 
1.5%

Versão brasileira
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing206208
Missing (%)91.9%
Memory size1.7 MiB
Dublado
11528 
Legendado
6545 

Length

Max length9
Median length7
Mean length7.724284845
Min length7

Characters and Unicode

Total characters139601
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDublado
2nd rowDublado
3rd rowDublado
4th rowDublado
5th rowDublado
ValueCountFrequency (%)
Dublado11528
 
5.1%
Legendado6545
 
2.9%
(Missing)206208
91.9%
Histogram of lengths of the category
ValueCountFrequency (%)
dublado11528
63.8%
legendado6545
36.2%

Most occurring characters

ValueCountFrequency (%)
d24618
17.6%
a18073
12.9%
o18073
12.9%
e13090
9.4%
D11528
8.3%
u11528
8.3%
b11528
8.3%
l11528
8.3%
L6545
 
4.7%
g6545
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter121528
87.1%
Uppercase Letter18073
 
12.9%

Most frequent character per category

ValueCountFrequency (%)
d24618
20.3%
a18073
14.9%
o18073
14.9%
e13090
10.8%
u11528
9.5%
b11528
9.5%
l11528
9.5%
g6545
 
5.4%
n6545
 
5.4%
ValueCountFrequency (%)
D11528
63.8%
L6545
36.2%

Most occurring scripts

ValueCountFrequency (%)
Latin139601
100.0%

Most frequent character per script

ValueCountFrequency (%)
d24618
17.6%
a18073
12.9%
o18073
12.9%
e13090
9.4%
D11528
8.3%
u11528
8.3%
b11528
8.3%
l11528
8.3%
L6545
 
4.7%
g6545
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII139601
100.0%

Most frequent character per block

ValueCountFrequency (%)
d24618
17.6%
a18073
12.9%
o18073
12.9%
e13090
9.4%
D11528
8.3%
u11528
8.3%
b11528
8.3%
l11528
8.3%
L6545
 
4.7%
g6545
 
4.7%

Classificação indicativa
Categorical

MISSING

Distinct6
Distinct (%)< 0.1%
Missing69772
Missing (%)31.1%
Memory size1.7 MiB
Livre
140613 
Não recomendado para menores de 10 anos
 
9120
Não recomendado para menores de 12 anos
 
3729
Não recomendado para menores de 14 anos
 
606
Não recomendado para menores de 16 anos
 
363

Length

Max length39
Median length5
Mean length8.057841291
Min length5

Characters and Unicode

Total characters1245009
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNão recomendado para menores de 14 anos
2nd rowLivre
3rd rowLivre
4th rowLivre
5th rowLivre
ValueCountFrequency (%)
Livre140613
62.7%
Não recomendado para menores de 10 anos9120
 
4.1%
Não recomendado para menores de 12 anos3729
 
1.7%
Não recomendado para menores de 14 anos606
 
0.3%
Não recomendado para menores de 16 anos363
 
0.2%
Não recomendado para menores de 18 anos78
 
< 0.1%
(Missing)69772
31.1%
Histogram of lengths of the category
ValueCountFrequency (%)
livre140613
59.1%
não13896
 
5.8%
para13896
 
5.8%
anos13896
 
5.8%
recomendado13896
 
5.8%
menores13896
 
5.8%
de13896
 
5.8%
109120
 
3.8%
123729
 
1.6%
14606
 
0.3%
Other values (2)441
 
0.2%

Most occurring characters

ValueCountFrequency (%)
e210093
16.9%
r182301
14.6%
L140613
11.3%
i140613
11.3%
v140613
11.3%
83376
 
6.7%
o69480
 
5.6%
a55584
 
4.5%
n41688
 
3.3%
d41688
 
3.3%
Other values (12)138960
11.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter979332
78.7%
Uppercase Letter154509
 
12.4%
Space Separator83376
 
6.7%
Decimal Number27792
 
2.2%

Most frequent character per category

ValueCountFrequency (%)
e210093
21.5%
r182301
18.6%
i140613
14.4%
v140613
14.4%
o69480
 
7.1%
a55584
 
5.7%
n41688
 
4.3%
d41688
 
4.3%
m27792
 
2.8%
s27792
 
2.8%
Other values (3)41688
 
4.3%
ValueCountFrequency (%)
113896
50.0%
09120
32.8%
23729
 
13.4%
4606
 
2.2%
6363
 
1.3%
878
 
0.3%
ValueCountFrequency (%)
L140613
91.0%
N13896
 
9.0%
ValueCountFrequency (%)
83376
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1133841
91.1%
Common111168
 
8.9%

Most frequent character per script

ValueCountFrequency (%)
e210093
18.5%
r182301
16.1%
L140613
12.4%
i140613
12.4%
v140613
12.4%
o69480
 
6.1%
a55584
 
4.9%
n41688
 
3.7%
d41688
 
3.7%
m27792
 
2.5%
Other values (5)83376
 
7.4%
ValueCountFrequency (%)
83376
75.0%
113896
 
12.5%
09120
 
8.2%
23729
 
3.4%
4606
 
0.5%
6363
 
0.3%
878
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1231113
98.9%
None13896
 
1.1%

Most frequent character per block

ValueCountFrequency (%)
e210093
17.1%
r182301
14.8%
L140613
11.4%
i140613
11.4%
v140613
11.4%
83376
 
6.8%
o69480
 
5.6%
a55584
 
4.5%
n41688
 
3.4%
d41688
 
3.4%
Other values (11)125064
10.2%
ValueCountFrequency (%)
ã13896
100.0%

Tipo de produção
Categorical

MISSING

Distinct3
Distinct (%)< 0.1%
Missing69845
Missing (%)31.1%
Memory size1.7 MiB
TV Escola
87863 
Externo
54826 
Coprodução
11747 

Length

Max length10
Median length9
Mean length8.366048072
Min length7

Characters and Unicode

Total characters1292019
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTV Escola
2nd rowCoprodução
3rd rowTV Escola
4th rowTV Escola
5th rowCoprodução
ValueCountFrequency (%)
TV Escola87863
39.2%
Externo54826
24.4%
Coprodução11747
 
5.2%
(Missing)69845
31.1%
Histogram of lengths of the category
ValueCountFrequency (%)
tv87863
36.3%
escola87863
36.3%
externo54826
22.6%
coprodução11747
 
4.8%

Most occurring characters

ValueCountFrequency (%)
o177930
13.8%
E142689
11.0%
T87863
 
6.8%
V87863
 
6.8%
87863
 
6.8%
s87863
 
6.8%
c87863
 
6.8%
l87863
 
6.8%
a87863
 
6.8%
r66573
 
5.2%
Other values (10)289786
22.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter873994
67.6%
Uppercase Letter330162
 
25.6%
Space Separator87863
 
6.8%

Most frequent character per category

ValueCountFrequency (%)
o177930
20.4%
s87863
10.1%
c87863
10.1%
l87863
10.1%
a87863
10.1%
r66573
 
7.6%
x54826
 
6.3%
t54826
 
6.3%
e54826
 
6.3%
n54826
 
6.3%
Other values (5)58735
 
6.7%
ValueCountFrequency (%)
E142689
43.2%
T87863
26.6%
V87863
26.6%
C11747
 
3.6%
ValueCountFrequency (%)
87863
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1204156
93.2%
Common87863
 
6.8%

Most frequent character per script

ValueCountFrequency (%)
o177930
14.8%
E142689
11.8%
T87863
 
7.3%
V87863
 
7.3%
s87863
 
7.3%
c87863
 
7.3%
l87863
 
7.3%
a87863
 
7.3%
r66573
 
5.5%
x54826
 
4.6%
Other values (9)234960
19.5%
ValueCountFrequency (%)
87863
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1268525
98.2%
None23494
 
1.8%

Most frequent character per block

ValueCountFrequency (%)
o177930
14.0%
E142689
11.2%
T87863
 
6.9%
V87863
 
6.9%
87863
 
6.9%
s87863
 
6.9%
c87863
 
6.9%
l87863
 
6.9%
a87863
 
6.9%
r66573
 
5.2%
Other values (8)266292
21.0%
ValueCountFrequency (%)
ç11747
50.0%
ã11747
50.0%

Ano de produção
Real number (ℝ≥0)

MISSING
SKEWED

Distinct26
Distinct (%)< 0.1%
Missing72511
Missing (%)32.3%
Infinite0
Infinite (%)0.0%
Mean2020.70004
Minimum1964
Maximum20118
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 MiB

Quantile statistics

Minimum1964
5-th percentile2005
Q12011
median2013
Q32016
95-th percentile2018
Maximum20118
Range18154
Interquartile range (IQR)5

Descriptive statistics

Standard deviation377.328023
Coefficient of variation (CV)0.1867313385
Kurtosis2294.056144
Mean2020.70004
Median Absolute Deviation (MAD)3
Skewness47.91390264
Sum306681645
Variance142376.4369
MonotocityNot monotonic
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
201322566
 
10.1%
201619320
 
8.6%
201416065
 
7.2%
201113044
 
5.8%
201812160
 
5.4%
200910965
 
4.9%
201510481
 
4.7%
201710151
 
4.5%
20129023
 
4.0%
20106786
 
3.0%
Other values (16)21209
 
9.5%
(Missing)72511
32.3%
ValueCountFrequency (%)
196433
 
< 0.1%
198966
 
< 0.1%
1998408
 
0.2%
1999840
0.4%
20001608
0.7%
ValueCountFrequency (%)
2011821
 
< 0.1%
2010645
 
< 0.1%
20194220
 
1.9%
201812160
5.4%
201710151
4.5%

País de origem
Categorical

MISSING

Distinct39
Distinct (%)< 0.1%
Missing73518
Missing (%)32.8%
Memory size1.7 MiB
Brasil
114451 
França
12400 
Coréia do Sul
 
4077
Reino Unido
 
3951
Espanha
 
2388
Other values (34)
13496 

Length

Max length14
Median length6
Mean length6.518170904
Min length3

Characters and Unicode

Total characters982699
Distinct characters48
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBrasil
2nd rowAlemanha
3rd rowBrasil
4th rowBrasil
5th rowBrasil
ValueCountFrequency (%)
Brasil114451
51.0%
França12400
 
5.5%
Coréia do Sul4077
 
1.8%
Reino Unido3951
 
1.8%
Espanha2388
 
1.1%
Irlanda1560
 
0.7%
Canadá1551
 
0.7%
Colômbia1489
 
0.7%
Estados Unidos1479
 
0.7%
Inglaterra1461
 
0.7%
Other values (29)5956
 
2.7%
(Missing)73518
32.8%
Histogram of lengths of the category
ValueCountFrequency (%)
brasil114451
69.6%
frança12400
 
7.5%
coréia4077
 
2.5%
sul4077
 
2.5%
do4077
 
2.5%
unido3951
 
2.4%
reino3951
 
2.4%
espanha2388
 
1.5%
irlanda1560
 
0.9%
canadá1551
 
0.9%
Other values (35)11918
 
7.2%

Most occurring characters

ValueCountFrequency (%)
a164915
16.8%
r136892
13.9%
i133831
13.6%
l125345
12.8%
s122023
12.4%
B114613
11.7%
n31356
 
3.2%
o23543
 
2.4%
d14205
 
1.4%
13638
 
1.4%
Other values (38)102338
10.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter808719
82.3%
Uppercase Letter160333
 
16.3%
Space Separator13638
 
1.4%
Dash Punctuation9
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
a164915
20.4%
r136892
16.9%
i133831
16.5%
l125345
15.5%
s122023
15.1%
n31356
 
3.9%
o23543
 
2.9%
d14205
 
1.8%
ç12400
 
1.5%
e7374
 
0.9%
Other values (17)36835
 
4.6%
ValueCountFrequency (%)
B114613
71.5%
F12400
 
7.7%
C7621
 
4.8%
U5541
 
3.5%
E4473
 
2.8%
S4230
 
2.6%
R3972
 
2.5%
I3540
 
2.2%
A1339
 
0.8%
M1090
 
0.7%
Other values (9)1514
 
0.9%
ValueCountFrequency (%)
13638
100.0%
ValueCountFrequency (%)
-9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin969052
98.6%
Common13647
 
1.4%

Most frequent character per script

ValueCountFrequency (%)
a164915
17.0%
r136892
14.1%
i133831
13.8%
l125345
12.9%
s122023
12.6%
B114613
11.8%
n31356
 
3.2%
o23543
 
2.4%
d14205
 
1.5%
F12400
 
1.3%
Other values (36)89929
9.3%
ValueCountFrequency (%)
13638
99.9%
-9
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII959320
97.6%
None23379
 
2.4%

Most frequent character per block

ValueCountFrequency (%)
a164915
17.2%
r136892
14.3%
i133831
14.0%
l125345
13.1%
s122023
12.7%
B114613
11.9%
n31356
 
3.3%
o23543
 
2.5%
d14205
 
1.5%
13638
 
1.4%
Other values (30)78959
8.2%
ValueCountFrequency (%)
ç12400
53.0%
é5386
23.0%
á2541
 
10.9%
ô1723
 
7.4%
ã1122
 
4.8%
í99
 
0.4%
â87
 
0.4%
Á21
 
0.1%

Data primeira exibição
Categorical

MISSING

Distinct28
Distinct (%)2.6%
Missing223197
Missing (%)99.5%
Memory size1.7 MiB
20-08-2017
90 
13-04-2016
 
45
21-05-2014
 
45
06-11-2015
 
45
05-01-2017
 
45
Other values (23)
814 

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters10840
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row01-01-2014
2nd row02-01-2014
3rd row01-01-2014
4th row02-01-2014
5th row01-01-2014
ValueCountFrequency (%)
20-08-201790
 
< 0.1%
13-04-201645
 
< 0.1%
21-05-201445
 
< 0.1%
06-11-201545
 
< 0.1%
05-01-201745
 
< 0.1%
23-10-201445
 
< 0.1%
06-08-201445
 
< 0.1%
16-10-201445
 
< 0.1%
14-04-201645
 
< 0.1%
30-07-201445
 
< 0.1%
Other values (18)589
 
0.3%
(Missing)223197
99.5%
Histogram of lengths of the category
ValueCountFrequency (%)
20-08-201790
 
8.3%
16-07-201445
 
4.2%
06-08-201445
 
4.2%
12-10-201545
 
4.2%
13-08-201445
 
4.2%
05-11-201545
 
4.2%
05-01-201745
 
4.2%
21-05-201445
 
4.2%
23-10-201445
 
4.2%
30-07-201445
 
4.2%
Other values (18)589
54.3%

Most occurring characters

ValueCountFrequency (%)
12444
22.5%
02393
22.1%
-2168
20.0%
21565
14.4%
4738
 
6.8%
5402
 
3.7%
6307
 
2.8%
8297
 
2.7%
7262
 
2.4%
3255
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number8672
80.0%
Dash Punctuation2168
 
20.0%

Most frequent character per category

ValueCountFrequency (%)
12444
28.2%
02393
27.6%
21565
18.0%
4738
 
8.5%
5402
 
4.6%
6307
 
3.5%
8297
 
3.4%
7262
 
3.0%
3255
 
2.9%
99
 
0.1%
ValueCountFrequency (%)
-2168
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common10840
100.0%

Most frequent character per script

ValueCountFrequency (%)
12444
22.5%
02393
22.1%
-2168
20.0%
21565
14.4%
4738
 
6.8%
5402
 
3.7%
6307
 
2.8%
8297
 
2.7%
7262
 
2.4%
3255
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII10840
100.0%

Most frequent character per block

ValueCountFrequency (%)
12444
22.5%
02393
22.1%
-2168
20.0%
21565
14.4%
4738
 
6.8%
5402
 
3.7%
6307
 
2.8%
8297
 
2.7%
7262
 
2.4%
3255
 
2.4%

Faixa etária
Categorical

HIGH CARDINALITY
MISSING

Distinct277
Distinct (%)0.2%
Missing87735
Missing (%)39.1%
Memory size1.7 MiB
16-18
35184 
A partir de 18
23054 
10-12
17022 
07-09
15357 
13-15
13753 
Other values (272)
32176 

Length

Max length49
Median length5
Mean length8.627063407
Min length5

Characters and Unicode

Total characters1177991
Distinct characters20
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)0.1%

Sample

1st row13-15
2nd row13-15
3rd row13-15
4th row07-09
5th row10-12
ValueCountFrequency (%)
16-1835184
15.7%
A partir de 1823054
 
10.3%
10-1217022
 
7.6%
07-0915357
 
6.8%
13-1513753
 
6.1%
03-069778
 
4.4%
03-06, 07-093815
 
1.7%
A partir de 18, 16-182818
 
1.3%
16-18, A partir de 182629
 
1.2%
13-15, 16-182409
 
1.1%
Other values (267)10727
 
4.8%
(Missing)87735
39.1%
Histogram of lengths of the category
ValueCountFrequency (%)
16-1849231
18.8%
a31775
12.1%
1831775
12.1%
partir31775
12.1%
de31775
12.1%
07-0924281
9.3%
13-1523005
8.8%
10-1222088
8.4%
03-0616064
 
6.1%

Most occurring characters

ValueCountFrequency (%)
1220423
18.7%
-134669
11.4%
125223
10.6%
0102778
 
8.7%
881006
 
6.9%
665295
 
5.5%
r63550
 
5.4%
339069
 
3.3%
A31775
 
2.7%
p31775
 
2.7%
Other values (10)282428
24.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number602226
51.1%
Lowercase Letter254200
21.6%
Dash Punctuation134669
 
11.4%
Space Separator125223
 
10.6%
Uppercase Letter31775
 
2.7%
Other Punctuation29898
 
2.5%

Most frequent character per category

ValueCountFrequency (%)
1220423
36.6%
0102778
17.1%
881006
 
13.5%
665295
 
10.8%
339069
 
6.5%
724281
 
4.0%
924281
 
4.0%
523005
 
3.8%
222088
 
3.7%
ValueCountFrequency (%)
r63550
25.0%
p31775
12.5%
a31775
12.5%
t31775
12.5%
i31775
12.5%
d31775
12.5%
e31775
12.5%
ValueCountFrequency (%)
-134669
100.0%
ValueCountFrequency (%)
A31775
100.0%
ValueCountFrequency (%)
125223
100.0%
ValueCountFrequency (%)
,29898
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common892016
75.7%
Latin285975
 
24.3%

Most frequent character per script

ValueCountFrequency (%)
1220423
24.7%
-134669
15.1%
125223
14.0%
0102778
11.5%
881006
 
9.1%
665295
 
7.3%
339069
 
4.4%
,29898
 
3.4%
724281
 
2.7%
924281
 
2.7%
Other values (2)45093
 
5.1%
ValueCountFrequency (%)
r63550
22.2%
A31775
11.1%
p31775
11.1%
a31775
11.1%
t31775
11.1%
i31775
11.1%
d31775
11.1%
e31775
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII1177991
100.0%

Most frequent character per block

ValueCountFrequency (%)
1220423
18.7%
-134669
11.4%
125223
10.6%
0102778
 
8.7%
881006
 
6.9%
665295
 
5.5%
r63550
 
5.4%
339069
 
3.3%
A31775
 
2.7%
p31775
 
2.7%
Other values (10)282428
24.0%

Término da vigÊncia
Categorical

HIGH CARDINALITY
MISSING

Distinct56
Distinct (%)0.1%
Missing181255
Missing (%)80.8%
Memory size1.7 MiB
01-09-2018
7860 
29-02-2016
4668 
01-04-2015
3762 
31-10-2015
3267 
31-12-2022
 
1980
Other values (51)
21489 

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters430260
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row01-01-2015
2nd row01-01-2015
3rd row03-12-2015
4th row01-01-2015
5th row01-01-2015
ValueCountFrequency (%)
01-09-20187860
 
3.5%
29-02-20164668
 
2.1%
01-04-20153762
 
1.7%
31-10-20153267
 
1.5%
31-12-20221980
 
0.9%
01-05-20151749
 
0.8%
19-09-20151683
 
0.8%
27-12-20151386
 
0.6%
21-08-20171305
 
0.6%
30-01-20161170
 
0.5%
Other values (46)14196
 
6.3%
(Missing)181255
80.8%
Histogram of lengths of the category
ValueCountFrequency (%)
01-09-20187860
18.3%
29-02-20164668
 
10.8%
01-04-20153762
 
8.7%
31-10-20153267
 
7.6%
31-12-20221980
 
4.6%
01-05-20151749
 
4.1%
19-09-20151683
 
3.9%
27-12-20151386
 
3.2%
21-08-20171305
 
3.0%
30-01-20161170
 
2.7%
Other values (46)14196
33.0%

Most occurring characters

ValueCountFrequency (%)
0104757
24.3%
-86052
20.0%
180268
18.7%
270620
16.4%
522452
 
5.2%
920706
 
4.8%
811553
 
2.7%
611430
 
2.7%
39780
 
2.3%
47398
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number344208
80.0%
Dash Punctuation86052
 
20.0%

Most frequent character per category

ValueCountFrequency (%)
0104757
30.4%
180268
23.3%
270620
20.5%
522452
 
6.5%
920706
 
6.0%
811553
 
3.4%
611430
 
3.3%
39780
 
2.8%
47398
 
2.1%
75244
 
1.5%
ValueCountFrequency (%)
-86052
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common430260
100.0%

Most frequent character per script

ValueCountFrequency (%)
0104757
24.3%
-86052
20.0%
180268
18.7%
270620
16.4%
522452
 
5.2%
920706
 
4.8%
811553
 
2.7%
611430
 
2.7%
39780
 
2.3%
47398
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII430260
100.0%

Most frequent character per block

ValueCountFrequency (%)
0104757
24.3%
-86052
20.0%
180268
18.7%
270620
16.4%
522452
 
5.2%
920706
 
4.8%
811553
 
2.7%
611430
 
2.7%
39780
 
2.3%
47398
 
1.7%
Distinct2
Distinct (%)< 0.1%
Missing65028
Missing (%)29.0%
Memory size1.7 MiB
SIM
97799 
NÃO
61454 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters477759
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNÃO
2nd rowNÃO
3rd rowSIM
4th rowSIM
5th rowNÃO
ValueCountFrequency (%)
SIM97799
43.6%
NÃO61454
27.4%
(Missing)65028
29.0%
Histogram of lengths of the category
ValueCountFrequency (%)
sim97799
61.4%
não61454
38.6%

Most occurring characters

ValueCountFrequency (%)
S97799
20.5%
I97799
20.5%
M97799
20.5%
N61454
12.9%
Ã61454
12.9%
O61454
12.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter477759
100.0%

Most frequent character per category

ValueCountFrequency (%)
S97799
20.5%
I97799
20.5%
M97799
20.5%
N61454
12.9%
Ã61454
12.9%
O61454
12.9%

Most occurring scripts

ValueCountFrequency (%)
Latin477759
100.0%

Most frequent character per script

ValueCountFrequency (%)
S97799
20.5%
I97799
20.5%
M97799
20.5%
N61454
12.9%
Ã61454
12.9%
O61454
12.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII416305
87.1%
None61454
 
12.9%

Most frequent character per block

ValueCountFrequency (%)
S97799
23.5%
I97799
23.5%
M97799
23.5%
N61454
14.8%
O61454
14.8%
ValueCountFrequency (%)
Ã61454
100.0%

Licença TV
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing65028
Missing (%)29.0%
Memory size1.7 MiB
SIM
153672 
NÃO
 
5581

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters477759
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSIM
2nd rowSIM
3rd rowSIM
4th rowSIM
5th rowSIM
ValueCountFrequency (%)
SIM153672
68.5%
NÃO5581
 
2.5%
(Missing)65028
29.0%
Histogram of lengths of the category
ValueCountFrequency (%)
sim153672
96.5%
não5581
 
3.5%

Most occurring characters

ValueCountFrequency (%)
S153672
32.2%
I153672
32.2%
M153672
32.2%
N5581
 
1.2%
Ã5581
 
1.2%
O5581
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter477759
100.0%

Most frequent character per category

ValueCountFrequency (%)
S153672
32.2%
I153672
32.2%
M153672
32.2%
N5581
 
1.2%
Ã5581
 
1.2%
O5581
 
1.2%

Most occurring scripts

ValueCountFrequency (%)
Latin477759
100.0%

Most frequent character per script

ValueCountFrequency (%)
S153672
32.2%
I153672
32.2%
M153672
32.2%
N5581
 
1.2%
Ã5581
 
1.2%
O5581
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII472178
98.8%
None5581
 
1.2%

Most frequent character per block

ValueCountFrequency (%)
S153672
32.5%
I153672
32.5%
M153672
32.5%
N5581
 
1.2%
O5581
 
1.2%
ValueCountFrequency (%)
Ã5581
100.0%

Licença streaming
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing65028
Missing (%)29.0%
Memory size1.7 MiB
SIM
154671 
NÃO
 
4582

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters477759
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSIM
2nd rowSIM
3rd rowSIM
4th rowSIM
5th rowSIM
ValueCountFrequency (%)
SIM154671
69.0%
NÃO4582
 
2.0%
(Missing)65028
29.0%
Histogram of lengths of the category
ValueCountFrequency (%)
sim154671
97.1%
não4582
 
2.9%

Most occurring characters

ValueCountFrequency (%)
S154671
32.4%
I154671
32.4%
M154671
32.4%
N4582
 
1.0%
Ã4582
 
1.0%
O4582
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter477759
100.0%

Most frequent character per category

ValueCountFrequency (%)
S154671
32.4%
I154671
32.4%
M154671
32.4%
N4582
 
1.0%
Ã4582
 
1.0%
O4582
 
1.0%

Most occurring scripts

ValueCountFrequency (%)
Latin477759
100.0%

Most frequent character per script

ValueCountFrequency (%)
S154671
32.4%
I154671
32.4%
M154671
32.4%
N4582
 
1.0%
Ã4582
 
1.0%
O4582
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII473177
99.0%
None4582
 
1.0%

Most frequent character per block

ValueCountFrequency (%)
S154671
32.7%
I154671
32.7%
M154671
32.7%
N4582
 
1.0%
O4582
 
1.0%
ValueCountFrequency (%)
Ã4582
100.0%

Licença VoD
Categorical

MISSING

Distinct2
Distinct (%)< 0.1%
Missing65028
Missing (%)29.0%
Memory size1.7 MiB
SIM
157957 
NÃO
 
1296

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters477759
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSIM
2nd rowSIM
3rd rowSIM
4th rowSIM
5th rowSIM
ValueCountFrequency (%)
SIM157957
70.4%
NÃO1296
 
0.6%
(Missing)65028
29.0%
Histogram of lengths of the category
ValueCountFrequency (%)
sim157957
99.2%
não1296
 
0.8%

Most occurring characters

ValueCountFrequency (%)
S157957
33.1%
I157957
33.1%
M157957
33.1%
N1296
 
0.3%
Ã1296
 
0.3%
O1296
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter477759
100.0%

Most frequent character per category

ValueCountFrequency (%)
S157957
33.1%
I157957
33.1%
M157957
33.1%
N1296
 
0.3%
Ã1296
 
0.3%
O1296
 
0.3%

Most occurring scripts

ValueCountFrequency (%)
Latin477759
100.0%

Most frequent character per script

ValueCountFrequency (%)
S157957
33.1%
I157957
33.1%
M157957
33.1%
N1296
 
0.3%
Ã1296
 
0.3%
O1296
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII476463
99.7%
None1296
 
0.3%

Most frequent character per block

ValueCountFrequency (%)
S157957
33.2%
I157957
33.2%
M157957
33.2%
N1296
 
0.3%
O1296
 
0.3%
ValueCountFrequency (%)
Ã1296
100.0%

Finalidade do vídeo
Categorical

MISSING

Distinct2
Distinct (%)0.1%
Missing222742
Missing (%)99.3%
Memory size1.7 MiB
Site
1506 
Revista
 
33

Length

Max length7
Median length4
Mean length4.064327485
Min length4

Characters and Unicode

Total characters6255
Distinct characters8
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSite
2nd rowRevista
3rd rowSite
4th rowSite
5th rowRevista
ValueCountFrequency (%)
Site1506
 
0.7%
Revista33
 
< 0.1%
(Missing)222742
99.3%
Histogram of lengths of the category
ValueCountFrequency (%)
site1506
97.9%
revista33
 
2.1%

Most occurring characters

ValueCountFrequency (%)
i1539
24.6%
t1539
24.6%
e1539
24.6%
S1506
24.1%
R33
 
0.5%
v33
 
0.5%
s33
 
0.5%
a33
 
0.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter4716
75.4%
Uppercase Letter1539
 
24.6%

Most frequent character per category

ValueCountFrequency (%)
i1539
32.6%
t1539
32.6%
e1539
32.6%
v33
 
0.7%
s33
 
0.7%
a33
 
0.7%
ValueCountFrequency (%)
S1506
97.9%
R33
 
2.1%

Most occurring scripts

ValueCountFrequency (%)
Latin6255
100.0%

Most frequent character per script

ValueCountFrequency (%)
i1539
24.6%
t1539
24.6%
e1539
24.6%
S1506
24.1%
R33
 
0.5%
v33
 
0.5%
s33
 
0.5%
a33
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII6255
100.0%

Most frequent character per block

ValueCountFrequency (%)
i1539
24.6%
t1539
24.6%
e1539
24.6%
S1506
24.1%
R33
 
0.5%
v33
 
0.5%
s33
 
0.5%
a33
 
0.5%

Tipo de vídeo
Categorical

MISSING

Distinct2
Distinct (%)0.2%
Missing223102
Missing (%)99.5%
Memory size1.7 MiB
Entrevista
1023 
Geral
156 

Length

Max length10
Median length10
Mean length9.338422392
Min length5

Characters and Unicode

Total characters11010
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowGeral
2nd rowEntrevista
3rd rowGeral
4th rowEntrevista
5th rowGeral
ValueCountFrequency (%)
Entrevista1023
 
0.5%
Geral156
 
0.1%
(Missing)223102
99.5%
Histogram of lengths of the category
ValueCountFrequency (%)
entrevista1023
86.8%
geral156
 
13.2%

Most occurring characters

ValueCountFrequency (%)
t2046
18.6%
e1179
10.7%
r1179
10.7%
a1179
10.7%
E1023
9.3%
n1023
9.3%
v1023
9.3%
i1023
9.3%
s1023
9.3%
G156
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter9831
89.3%
Uppercase Letter1179
 
10.7%

Most frequent character per category

ValueCountFrequency (%)
t2046
20.8%
e1179
12.0%
r1179
12.0%
a1179
12.0%
n1023
10.4%
v1023
10.4%
i1023
10.4%
s1023
10.4%
l156
 
1.6%
ValueCountFrequency (%)
E1023
86.8%
G156
 
13.2%

Most occurring scripts

ValueCountFrequency (%)
Latin11010
100.0%

Most frequent character per script

ValueCountFrequency (%)
t2046
18.6%
e1179
10.7%
r1179
10.7%
a1179
10.7%
E1023
9.3%
n1023
9.3%
v1023
9.3%
i1023
9.3%
s1023
9.3%
G156
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII11010
100.0%

Most frequent character per block

ValueCountFrequency (%)
t2046
18.6%
e1179
10.7%
r1179
10.7%
a1179
10.7%
E1023
9.3%
n1023
9.3%
v1023
9.3%
i1023
9.3%
s1023
9.3%
G156
 
1.4%

Estado
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.7 MiB
CONSOLIDADO
224281 

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters2467091
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCONSOLIDADO
2nd rowCONSOLIDADO
3rd rowCONSOLIDADO
4th rowCONSOLIDADO
5th rowCONSOLIDADO
ValueCountFrequency (%)
CONSOLIDADO224281
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
consolidado224281
100.0%

Most occurring characters

ValueCountFrequency (%)
O672843
27.3%
D448562
18.2%
C224281
 
9.1%
N224281
 
9.1%
S224281
 
9.1%
L224281
 
9.1%
I224281
 
9.1%
A224281
 
9.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter2467091
100.0%

Most frequent character per category

ValueCountFrequency (%)
O672843
27.3%
D448562
18.2%
C224281
 
9.1%
N224281
 
9.1%
S224281
 
9.1%
L224281
 
9.1%
I224281
 
9.1%
A224281
 
9.1%

Most occurring scripts

ValueCountFrequency (%)
Latin2467091
100.0%

Most frequent character per script

ValueCountFrequency (%)
O672843
27.3%
D448562
18.2%
C224281
 
9.1%
N224281
 
9.1%
S224281
 
9.1%
L224281
 
9.1%
I224281
 
9.1%
A224281
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII2467091
100.0%

Most frequent character per block

ValueCountFrequency (%)
O672843
27.3%
D448562
18.2%
C224281
 
9.1%
N224281
 
9.1%
S224281
 
9.1%
L224281
 
9.1%
I224281
 
9.1%
A224281
 
9.1%

Visualizações
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct2248
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.34460342
Minimum0
Maximum3502627
Zeros111000
Zeros (%)49.5%
Negative0
Negative (%)0.0%
Memory size1.7 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q322
95-th percentile240
Maximum3502627
Range3502627
Interquartile range (IQR)22

Descriptive statistics

Standard deviation8190.192551
Coefficient of variation (CV)89.66257714
Kurtosis151539.6557
Mean91.34460342
Median Absolute Deviation (MAD)1
Skewness369.0234795
Sum20486859
Variance67079254.02
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0111000
49.5%
18357
 
3.7%
26021
 
2.7%
34706
 
2.1%
43875
 
1.7%
53244
 
1.4%
62986
 
1.3%
72664
 
1.2%
82529
 
1.1%
92392
 
1.1%
Other values (2238)76507
34.1%
ValueCountFrequency (%)
0111000
49.5%
18357
 
3.7%
26021
 
2.7%
34706
 
2.1%
43875
 
1.7%
ValueCountFrequency (%)
35026271
< 0.1%
10751051
< 0.1%
9587021
< 0.1%
7047191
< 0.1%
2003321
< 0.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

DataMêsAnoTítuloNome do programaNúmero do programaData de registroData de publicaçãoDuraçãoStatus de visualizaçãoSérieNúmero do episódioÁreas temáticasEtapas de ensinoPúblicos-alvoMECFlixMEC REDDisp. TV Escola CriançasInéditoLIBRASFunção do vídeoVersão brasileiraClassificação indicativaTipo de produçãoAno de produçãoPaís de origemData primeira exibiçãoFaixa etáriaTérmino da vigÊnciaVisualização sem autenticaçãoLicença TVLicença streamingLicença VoDFinalidade do vídeoTipo de vídeoEstadoVisualizações
02014-01-01Janeiro2014TESTE TVoD upload via base integradoraNaNNaN23-08-201316-12-201400:00:23NaNNaNNaNDiversidade Cultural, FísicaGeralPúblico em geralNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
12014-01-01Janeiro2014Teste TVoD Habitantes de BabelNaNNaN26-08-201302-01-201400:26:26NaNNaNNaNDiversidade CulturalGeralPúblico em geralNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
22014-01-01Janeiro2014Música 005NaNNaN16-04-201424-06-201400:22:04NaNNaNNaNMúsicaEnsino Fundamental IIPúblico em geralNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
32014-01-01Janeiro2014Música 005NaNNaN16-04-201424-06-201400:27:08NaNNaNNaNMúsicaGeralPúblico em geralNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
42014-01-01Janeiro2014Alimentação - Com librasNaNNaN16-04-201410-06-201400:26:50NaNNaNNaNArtes, Matemática, SaúdeEnsino Fundamental IAlunoNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
52014-01-01Janeiro2014Esporte - Com librasNaNNaN16-04-201410-06-201400:26:45NaNNaNNaNEducação Física, Matemática, SaúdeEnsino Fundamental IAlunoNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
62014-01-01Janeiro2014Canção do Monstro OzoNaNNaN22-04-201402-12-201400:00:41NaNNaNNaNMúsicaEducação InfantilPúblico em geralNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
72014-01-01Janeiro2014Canção das LetrasNaNNaN22-04-201422-04-201400:00:40NaNNaNNaNLíngua Portuguesa, MúsicaCiclo de Alfabetização, Educação InfantilPúblico em geralNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
82014-01-01Janeiro2014Canção da Terra da FertilidadeNaNNaN22-04-201422-04-201400:00:36NaNNaNNaNLíngua Portuguesa, MúsicaCiclo de Alfabetização, Educação InfantilPúblico em geralNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0
92014-01-01Janeiro2014Canção das PlacasNaNNaN22-04-201422-04-201400:00:41NaNNaNNaNLíngua Portuguesa, MúsicaCiclo de Alfabetização, Educação InfantilPúblico em geralNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNCONSOLIDADO0

Last rows

DataMêsAnoTítuloNome do programaNúmero do programaData de registroData de publicaçãoDuraçãoStatus de visualizaçãoSérieNúmero do episódioÁreas temáticasEtapas de ensinoPúblicos-alvoMECFlixMEC REDDisp. TV Escola CriançasInéditoLIBRASFunção do vídeoVersão brasileiraClassificação indicativaTipo de produçãoAno de produçãoPaís de origemData primeira exibiçãoFaixa etáriaTérmino da vigÊnciaVisualização sem autenticaçãoLicença TVLicença streamingLicença VoDFinalidade do vídeoTipo de vídeoEstadoVisualizações
2242712020-01-01Janeiro2020A grande corridaZOU043 A GRANDE CORRIDA43.002-07-201905-09-201900:11:33LiberadoZOU43.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO0
2242722020-01-01Janeiro2020O aquário do ZouZOU044 O AQUARIO DO ZOU44.003-07-201905-09-201900:11:44LiberadoZOU44.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO0
2242732020-01-01Janeiro2020O robô do ZouZOU045 O ROBO DO ZOU45.003-07-201905-09-201900:11:41LiberadoZOU45.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO0
2242742020-01-01Janeiro2020A viagem de ZouZOU046 A VIAGEM DE ZOU46.003-07-201905-09-201900:11:42LiberadoZOU46.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO0
2242752020-01-01Janeiro2020Piratas e fadasZOU047 PIRATAS E FADAS47.003-07-201905-09-201900:11:26LiberadoZOU47.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO1
2242762020-01-01Janeiro2020Zou e o balãoZOU048 ZOU E O BALAO48.003-07-201905-09-201900:11:34LiberadoZOU48.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO0
2242772020-01-01Janeiro2020Zou, o palhaçoZOU049 O PALHACO49.003-07-201905-09-201900:11:57LiberadoZOU49.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO3
2242782020-01-01Janeiro2020Zou e o coelhinho da páscoaZOU050 O COELHINHO DA PASCOA50.003-07-201905-09-201900:11:41LiberadoZOU50.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO3
2242792020-01-01Janeiro2020Zou e a queda de energiaZOU051 A QUEDA DE ENERGIA51.003-07-201905-09-201900:11:35LiberadoZOU51.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO0
2242802020-01-01Janeiro2020A confissão do ZouZOU052 A CONFISSAO DO ZOU52.003-07-201905-09-201900:11:42LiberadoZOU52.0Ciências, Meio Ambiente, Ética, ArtesEducação InfantilPúblico em geral, Professor, AlunoSIMSIMSIMNÃONÃOProgramaDubladoLivreExterno2012.0FrançaNaN03-06NaNNÃOSIMSIMSIMNaNNaNCONSOLIDADO4